S.O.S. APPROXIMATION OF POLYNOMIALS, 
NONNEGATIVE ON A REAL ALGEBRAIC SET 



JEAN B. LASSERRE 

Abstract. Wih every real polynomial /, we associate a family {fer}e,r 
of real polynomials, in explicit form in terms of / and the parameters 
e > 0,r e N, and such that ||/ - f^rWi ^ as e ^ 0. 

Let V C R" be a real algebraic set described by finitely many poly- 
nomials equations gj{x) = 0,j £ J, and let / be a real polynomial, 
nonnegative on V. We show that for every e > 0, there exist nonnega- 
tive scalars {Aj(e)}_,gj such that, for all r sufficiently large, 

fer + Aj (e) , is a sum of squares. 

je.J 

This representation is an obvious certificate of nonnegativity of f^r on V, 
and very specific in terms of the gj that define the set V . In particular, it 
is valid with no assumption onV . In addition, this representation is also 
useful from a computation point of view, as we can define semidefinite 
programing relaxations to approximate the global minimum of / on 
a real algebraic set V, or a semi-algebraic set K, and again, with no 
assumption on V or K. 



1. Introduction 

Let V C be the real algebraic set 

(1.1) V:={x£W'\ gj{x) = 0, j = l,...,m}, 

for some family of real polynomials {gj} C M[2;](= M[2;i, . . . , Xn])- 

The main motivation of this paper is to provide a characterization of poly- 
nomials / G M[x], nonnegative on V, in terms of a certificate of positivity. 
In addition, and in view of the many potential applications, one would like 
to obtain a representation that is also useful from a computational point of 
view. 

In some particular cases, when V is compact, and viewing the equations 
gj{x) = as two opposite inequations gj{x) > and gj{x) < 0, one may 
obtain Schmiidgen's sum of squares (s.o.s.) representation ^7] for / + e 
(e > 0), instead of /. Under an additional assumption on the gj^s that 
define V, the latter representation may be even refined to become Putinar 
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^31 and Jacobi and Prestel [1] s.o.s. representation, that is, / + e can be 
written 

m 

(1-2) / + e = /o + ^/,5i, 

i=i 

for some polynomials {/,} C M[x], with /o a s.o.s. Hence, if / is nonnegative 
on V , every approximation / + e of / (with e > 0) has the representation 
(|1.2j) . The interested reader is referred to Marshall JOl) Prestel and Delzell 
jl2j . and Scheiderer |151 116j for a nice account of such results. 

Contribution. We propose the following result: Let ||/||i = X^^l/al 
whenever x ^ f{x) = '}2afo'^°')- f ^ nonnegative on V, as 

defined in and let F := {fer}e.r be the family of polynomials 

^ " 2fc 

(1.3) /^^ = / + e^^^, e>0, rGN. 

fc=0 j=l 

(So, for every rGN, ||/ — fer\\i — > as e J, 0.) 

Then, for every e > 0, there exist nonnegative scalars {Aj(e)}^'^^, such 
that for all r sufficiently large (say r > r(e)), 

m 

(1-4) fer = qe-Y.H^)9l 

i=i 

for some s.o.s. polynomial G M.[x\^ that is, f^r + Z^JLi ^j{^)9j s.o.s. 

Thus, with no assumption on the set V ^ one obtains a representation of 
/er (which is positive on V as f^r > f for all e > 0) in the simple and explicit 
form H1.4|) . an obvious certificate of positivity of /^r- on V. In particular, when 
1/ = M", one retrieves the result of P|, which states that every nonnegative 
real polynomial / can be aproximated as closely as desired, by a family of 
s.o.s. polynomials {fer(e)}e, with f^r as in (|1.3|) . 

Notice that f + ne = f^o- So, on the one hand, the approximation f^r in 
(|1.4j) is more complicated than / + e in (|1.2|) , valid for the compact case with 
an additional assumption, but on the other hand, the coefficients of the gj^s 
in (|1.4|) are now scalars instead of s.o.s., and (|1.4j) is valid for an arbitrary 
algebraic set V. 

The case of a semi-algebraic set IK = {x G W^\gj{x) > 0, j = 1, . . . ,m} 
reduces to the case of an algebraic set V G M""''™', by introducing m slack 
variables {zj}, and replacing gj{x) > with gj{x) — z| = 0, for all j = 
1, . . . ,m. Let / G M[x] be nonnegative on K. Then, for every e > 0, there 
exist nonnegative scalars {Xj{e)}Y=i such that, for all sufficiently large r. 
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for some s.o.s. G M[x,2:]. Equivalently, everywhere on K, the polynomial 

fc=0 i=l k=0 j=l 

coincides with the polynomial x Qeixi, . . . , x„, \/gi{x), . . . , \/gm{x)), ob- 
viously nonnegative. 

The representation (|1.4|) is also useful for computational purposes. In- 
deed, using H1.4() . one can approximate the global minimum of / on V, by 
solving a sequence of semidefinite programming (SDP) problems. The same 
applies to an arbitrary semi-algebraic set K C M", defined by m polynomials 
inequalities, as explained above. Again, and in contrast to previous SDP- 
relaxation techniques as in e.g. [HI 13 El El > no compacity assumption 
on y or ]K is required. 

In a sense, the family F = {fer} C M[x] (with /or- = /) is a set of 
regularizations of /, because one may approximate / by members of F, and 
those members always have nice representations when / is nonnegative on 
an algebraic set V (including the case V = R"), whereas / itself might not 
have such a nice representation. 

Methodology. To prove our main result, we proceed in three main steps. 

1. We first define an infinite dimensional linear programming problem 
on an appropriate space of measures, whose optimal value is the global 
minimum of / on the set V. 

2. We then prove a crucial result, namely that there is no duality gap 
between this linear programming problem and its dual. The approach is 
similar but different from that taken in |2l when V = M". Indeed, the ap- 
proach in ini does not work when V ^ M". Here, we use the important fact 
that the polynomial 9r is a moment function. And so, if a set of probabil- 
ity measures 11 satisfies sup^gn/ ^rdfi < oo, it is tight, and therefore, by 
Prohorov's theorem, relatively compact. This latter intermediate result is 
crucial for our purpose. 

3. In the final step, we use our recent result :9.i which states that if a 
polynomial h € M[x] is nonnegative on M", then h + eOy (e > 0) is a sum of 
squares, provided that r is sufficiently large. 

The paper in organized as follows. After introducing the notation and 
definitions in ^ some preliminary results are stated in Jj^l whereas our main 
result is stated and discussed in Jjl] For clarity of exposition, most proofs 
are postponed in Jj^l and some auxiliary results are stated in an Appendix; 
in particular, duality results for linear programming in infinite-dimensional 
spaces are briefly reviewed. 

2. Notation and definitions 

Let M+ C M denote the cone of nonnegative real numbers. For a real 
symmetric matrix A, the notation A ^ (resp. A>- {)) stands for A positive 
semideflnite (resp. positive definite). The sup-norm sup^ \xj\ of a vector 
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X € M", is denoted by || x||oo- Let M[x] be the ring of real polynomials, and 
let 

(2.1) Vr{x) := {l,Xi,X2, . . .Xn,xl,XiX2, ■ ■ ■ , XiX^, Xg, X2X3, . . . , X^) 

be the canonical basis for the M-vector space Ar of real polynomials of 
degree at most r, and let s(r) be its dimension. Similarly, Voc{x) denotes 
the canonical basis of Mf R-vector space, denoted A. So a vector in 

A has always finitely many zeros. 

Therefore, a polynomial p Ar is written 

X 1-^ p{x) = ^PqX" = {p,Vr{x)), X € R", 

a 

(where x" = x"^X2^ . . . x"") for some vector p = {pa} £ R*^^-*, the vector of 
coefficients of p in the basis (|2.1() . 

Extending p with zeros, we can also consider p as a vector indexed in the 
basis Vcx)ix) (i.e. p € A). If we equip A with the usual scalar product (., .) 
of vectors, then for every p € A, 

P{x) = ^ Pax"" = (p,i;oo(a;)), x G R". 

Given a sequence y = {^q} indexed in the basis ^00(2^)1 let Ly : ^ — > R 
be the linear functional 

(2.2) p^Lyip) := ^ paVa = (P,y)- 

Given a sequence y = {ua} indexed in the basis ^00(2^)) the moment 
matrix M^(y) € R^('^)x^('^) with rows and columns indexed in the basis Vr{x) 
in (|2.1|) . satisfies 

(2.3) [M,(y)(l,i) = y„ and M,(y)(i,l) = y^] M,(y)(i,i) = y^+g. 
For instance, with n = 2, 

yoo yio Vol y2o yu yo2 
yio y2o yii yso y2i yu 

M (y) = 

y2o yso y2i 2/40 ysi y22 
yu y2i yi2 2/31 2/22 yi3 
_ yo2 yi2 yo3 2/22 2/13 2/04 _ 

A sequence y = {2/0} has a representing measure /iy if 

(2.4) ya = [ x^dfiy, Vq £ N". 

In this case one also says that y is a moment sequence. In addition, if fj,y is 
unique then y is said to be a determinate moment sequence. 
The matrix Mr(y) defines a bilinear form (., .)y on Ar, by 

{Q,P)y ■= (q,-^r(y)p) = Ly{qp), q,p e Ar, 
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and if y has a representing measure fiy, then 

(2.5) Ly{q^) = {ci,Mr{y)ci) = [ q{xf fiy{dx) > 0, £ A, 

SO that M,.(y) is positive semidefinite, i.e., Mr{y) ^ 0. 

3. Preliminaries 

Let V C M" be the real algebraic set defined in and let Bm be the 

closed ball 

(3.1) Bm = {x€ M" I \\x\\oo < M}. 

Proposition 3.1. Let f G be such that —oo < f* := ini^^y f{x). 

Then, for every e > 0, there is some € N such that 

fit := inf {fix) I x G Bm nV} < f* + e, VM > M,. 

Equivalently, f^j If* as M ^ oo. 

Proof. Suppose it is false. That is, there is some eo > and an infinite 
sequence sequence {M^} C N, with — > oo, such that fl^^ > /* + eq for 
all k. But let xq & V he such that /(xq) < /* + eo- With any > ||xo||oo) 
one obtains the contradiction /* + eo < /X/^ < f{xo) < /* + eo- D 

For every r G N, let Or G be the polynomial 

A;! ' 



(3.2) ^ ^ e,(:,) := 



fc=0 i=l 

and notice that n < 6r{x) < X^"^;^ e^'^ =: 0oo(x), for all x G M"-. Moreover, 
0^ is a moment function, as it satisfies 

(3.3) lim inf ^^(x) = +oo, 

where B'^ denotes the complement of Bm in M"; see e.g. Hernandez-Lerma 
and Lasserre pj p. 10]. 

Next, with V as in introduce the following optimization problems. 

(3.4) P : /* := inf /(x). 



and for < M G N, r G N U {oo}, 
(3.5) VI, 



inf f dfi 

rL. 



s.t. fgjdfi < 0, i = l,...,m 



M G 

where V{W^) is the space of probability measures on (with B its asso- 
ciated Borel fj-algebra). The respective optimal values of P and Vm are 
denoted infP = /* and infP]^, or minP and minVM if the minimum is 
attained (in which case, the problem is said to be solvable). 



6 



JEAN B. LASSERRE 



Proposition 3.2. Let f G R[x], and let P and Vlj be as in and /TT??)) 

respectively. Assume that f* > — oo. Then, for every r E NU{oo}, iniV^j [ 
/* as M —5- oo. // / has a global minimizer x* € V, then min'PJ^j = /* 
whenever M > ||x*||oo- 

Proof. When M is sufficiently large, Bm n F 7^ 0, and so, V^j is consistent, 
and 'miV\[ < 00. Let // G P(M") be admissible for V\^. From f gj dfi < 
for all j = l,...,m, it follows that gj{x)'^ = for /i-almost all x G M", 
j = 1, . . . , m, That is, for every j = 1, . . . ,m, there exists a set Aj G B such 
that /i(j4p = and gj{x) = for all x G Aj. Take A = Plj^dj G B so that 
^(^'^) = 0, and for all x G A, gj{x) = for all j = 1, . . . ,m. Therefore, 
^ Cl^, and as niA") = 0, 

/ = / fdii>f* because / > /* on A C 

which proves inf > /*• 

As F is closed and Bm is closed and bounded, the set Bm n F is compact 
and so, with /j^^ as in Proposition 13.11 there is some x G Bm H V such that 
f{x) = f^. In addition let ;U G ^(M") be the Dirac probability measure at 
the point x. As ||x||oo < M, 

OrdpL = dr{x) < ne^\ 

Moreover, as a; G y, gj{x) = 0, for all j = 1, . . . , m, and so 

J gj dfi = gjixf = 0, j = l,...,m, 

so that fj, is an admissible solution of V^j with value f f dfj, = f{x) = /^^, 
which proves that inf PJj- < This latter fact, combined with Proposition 
13. H and with /* < inf V^^, implies inf V^^ | /* as M — > 00, the desired result. 
The final statement is immediate by taking as feasible solution for V^, the 
Dirac probability measure at the point x* G Bm<^V (with M > ||x*||oo)- As 
its value is now /*, it is also optimal, and so, "PjJ^ is solvable with optimal 
value min V^j = f* ■ D 

Consider now, the following optimization problem Q^^, the dual problem 
of Vl^, i.e., 

max 7 — noe 

(3-6) Qlr- s.t. /+V + Er=iA,<?| >7 

7 G M,(^ G M+, A G 



with optimal value denoted by sup Q\j. Indeed, is a dual of 'PJj because 
weak duality holds. To see this, consider any two feasible solutions /i G 
I and (A, 5, 7) G M!^ x M+ x M, of Vlj and Q^, respectively. Then, 



SOS APPROXIMATION OF NONNEGATIVE POLYNOMIALS 



7 



integrating both sides of the inequahty in Q^.j with respect to /i, yields 



Hence, the value of any feasible solution of Q^^ is always smaller than the 
value of any feasible solution of "PjJ,^, i.e., weak duality holds. 

In fact we can get the more important and crucial following result. 

Theorem 3.3. Let M be large enough so that Bm<^V / 0. Let f € W[x], and 
letro > max[deg/, deg(7j]. Then, for every r > tq, V^.j is solvable, and there 
is no duality gap between V^^ and its dual Q\.j . That is, sup Q^vf = "^m • 

For a proof see ^5.11 We finally end up this section by re-stating a result 
proved in [S], which, together with Theorem l3.31 will be crucial to prove our 
main result. 

Theorem 3.4 (Hj). Let f € M[x] be nonnegative. Then for every e > 0, 
there is some r(e) € N such that, 

(3.7) fer{e) (= / + ^^r{t)) ^-^ o, sum of squares, 

and so is fer, for all r > r{e). 

4. Main result 

Recah that for given (e,r) G R x N, = / + eOr, with Or G W\_x\ being 
the polynomial defined in (|3.2|) . We now state our main result: 

Theorem 4.1. Let V C M" he as in and let f ^M\x\ he nonnegative 

on V . Then, for every e > 0, there exists r(e) € N and nonnegative scalars 
{-^j}j=i; such that, for all r > r(e), 



for some s.o.s. polynomial q £ ]R[x]. In addition, \\f — ferWi ^0, as e | 0. 

For a proof see ii5.2l 

Remark 4.2. (i) Observe that 1)4. 1|) is an obvious certificate of positivity of 
f^r on the algebraic set V, because everywhere on V, f^^r coincides with the 
s.o.s. polynomial q. Therefore, when / is nonnegative on V, one obtains 
with no assumption on the algebraic set V, a certificate of positivity for 
any approximation f^r of / (with r > r(e)), whereas / itself might not have 
such a representation. In other words, the (e, r)~perturbation f^j. of /, has 
a regularization effect on / as it permits to derive nice representations. 




and so, using that /i is feasible for V^, 




m 



(4.1) 
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(ii) From the proof of Theorem 14. H instead of the representation (|4.H1 , 
one may also provide the alternative representation 

m 

f^r = q- X ^9j, 

i=i 

for some s.o.s. polynomial q, and some (single) nonnegative scalar A (instead 
of m nonnegative scalars in ()4.1|) ). 

4.1. The case of a semi-algebraic set. We now consider the represen- 
tation of polynomials, nonnegative on a semi algebraic set IC C M", defined 
as, 

(4.2) ]K:={xGM"I gj{x) > 0, j = l,...,m}, 

for some family {gj}Y=i C 

One may apply the machinery developed previously for algebraic sets, 
because the semi-algebraic set IK may be viewed as the projection on M", of 
an algebraic set in M""'"™. Indeed, let V C M""*"™" be the algebraic set defined 
as 

(4.3) V := {{x,z) xW\ gj{x)-z] = 0, j = l,...,m}. 



Then every x G K is associated with the point {x, \J gi{x), . . . , yfhrJx)) G V ■ 
Let := . . . , z^], and M[x, z] := M[xi, . . . x„, zi, . . . , Zm], and for 

every r G N, let fr S M[z] be the polynomial 

r m ^2k 

(4.4) z^Mz) =EEir- 

A:=0 j=l 

We then get : 

Corollary 4.3. Let K be as in i4-^ ! o,nd Or, '■Pr be as in i'y.iij) and i4-4[ )- Let 

f € M[x] be nonnegative on K. Then, for every e > 0, there exist nonnegative 
scalars {Xj}'jLi such that, for all r sufficiently large, 

m 

(4.5) / + e^r + e^r = Qe-Yl - 

i=i 

for some s.o.s. polynomial £ M[x,z]. 

Equivalently, everywhere on IK, the polynomial 

(4.6) --/w + ^EE^ + ^EE^ 

A:=0 i=l k=Q j=l 

coincides with the nonnegative polynomial x i— > qe{x, \/gi{x), . . . , -\/(7m(x)). 

So, as for the case of an algebraic set V C M", ()4.5|) is an obvious certificate 
of positivity on the semi-algebraic set IC, for the polynomial f^r G M[x, z] 

fer ■■= f + e0r + e^r, 
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and in addition, viewing / as an element of R[x, z], one has ||/ — /er||i ^ 
as e 1 0. Notice that no assumption on IK or on the gj^s that define K, is 
needed. 

Now, assume that K is compact and the gj^s that define K, satisfy Puti- 
nar's condition, i.e., (i) there exits some u G M.[x] such that u can be written 
uq + J2j'^j9j foi' some s.o.s. polynomials {ujlj^Q, and (ii), the level set 
{x\u{x) > 0} is compact. 

If / is nonnegative on K, then / + eOr is strictly positive on K, and 
therefore, by Putinar's theorem [13] 

m 

(4.7) f + eer= qo+Y^qjgj, 

i=i 

for some s.o.s. family {qj}^Q- One may thus either have Putinar's repre- 
sentation (jlTTj) in M"-, or (|I3|) via a lifting in 
One may relate (|4.5|) and (|4.7)) by 

g,(a;,z) = g,^(a;) +q'^{x,z'^), 

with 

m 

^ qlix) := go(a;) + J]] {qj{x)gj{x) + \jgj{xf) , 
i=i 

and 

m 

(x,2;) g?(a;,2;2) := eipr{z) + ^ Xjzj - 2gj{x)zj . 



4.2. Computational implications. The results of the previous section 
can be applied to compute (or at least approximate) the global minimum of 
/ on V. Indeed, with e > fixed, and 2r > max[deg/, deg^?], consider the 
convex optimization problem 



(4.8) 



min -Ly(/,, 
y 

s.t. Mr{y) 

yo 



< 





0, 

1, 



, m 



where 9r is as in (|3.2|) . Ly and Mr{y) are the linear functional and the 
moment matrix associated with a sequence y indexed in the basis 1)2. see 
(123 and in O 
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Qer is called a semidefinite programming (SDP) problem, and its associ- 
ated dual SDP problem reads 

max 7 



(4.9) 



S.t. fer-1 =^~^ ^jSj 



^1 

A G M™, A > 0, 

q G q s.o.s. of degree < 2r. 



Their optimal values are denoted inf Q^r and sup Q*,., respectively (or min 
and maxQ*^ if the optimum is attained, in which case the problems are said 
to be solvable). Both problems Q^^ and its dual Q*^ are nice convex opti- 
mization problems that, in principle, can be solved efficiently by standard 
software packages. For more details on SDP theory, the interested reader is 
referred to the survey paper jl9j . 

That weak duality holds between Q^^ and Q*^ is straightforward. Let 
y = {Va} and (A, 7, q) £ x M x R[x] be feasible solutions of Q^r and Q*^, 
respectively. Then, by linearity of Ly, 

Ly{fer)-1 = Ly(Ar " T) 

m m 

= Ly{q - ^ X,gf) = Ly{q) - J] X,Ly{g]) 
j=i j=i 

> Ly{q) [because Ly{g'j) < for all j = 1, . . . , m] 

> [because q is s.o.s. and Mr{y) ^ 0; see (|2.5() .] 

Therefore, Ly{f^r) > 7, the desired conclusion. Moreover, Q^r is an obvious 
relaxation of the perturbed problem 

Per : C := min {U \xeV}. 

X 

Indeed, let x G ^ and let y := V2rix) (see 1)2. i.e., y is the vector of 
moments (up to order 2r) of the Dirac measure at x (z V. Then, y is 
feasible for Q^r because yo = 1) Afr(y) h 0, and Ly{g'j) = gj{x)'^ = 0, for all 
j = 1,... ,m. Similarly, Ly(/er) = ferix). Therefore, inf Q^r < f*r- 

Theorem 4.4. Let V CW^ be as in m.l]) . and Or as in \3. '^) . Assume that 
f has a global minimizer x* (zV with f{x*) = f*. Let e > be fixed. Then 

n 

(4.10) r < supQ:, < infQ,, < f* + eOrix*) < T + e^e^^)', 

1=1 

provided that r is sufficiently large. 

Proof. Observe that the polynomial / — /* is nonnegative on V. Therefore, 
by Theorem 14.11 for every e there exists r(e) G N and A(e) G M™, such that 



2 

3 
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for some s.o.s. polynomial € M[x]. But this shows that (A(e),/*,ge) G 
X M X M[x] is a feasible solution of Q*^ as soon as r > r(e), in which 
case, supQ*j. > /*. Moreover, we have seen that inf Q^^ < fer{x) for any 
feasible solution x ^ V . In particular, inf Q^^ < /* + ^Gr{x*), from which 
(BTml) follows. □ 

Theorem 14 . 41 has a nice feature. Suppose that one knows some bound p on 
the norm |[oo of a global minimizer of / on V . Then, one may fix a priori 
the error bound 77 on | inf Qe,. — /*|. Indeed, let r] be fixed, and fix e > 
such that e < ry(ne'' )~^. By Theorem 14.41 one has /* < inf Q^^ </* + ??, 
provided that r is large enough. 

The same approach works to approximate the global minimum of a poly- 
nomial / on a semi-algebraic set IK, as defined in (|4.2|1 . In view of Corollary 
14.31 and via a lifting in M"+'", one is reduced to the case of a real algebraic 
set V C R"^"', so that Theorem 14.41 still applies. It is important to em- 
phasize that one requires no assumption on K, or on the gj^s that define EC. 
This is to be compared with previous SDP-relaxation techniques developed 
in e.g. ini d El ) where the set IK is supposed to be compact, and with 

an additional assumption on the gj''s to ensure that Putinar's representation 
[El holds. 

5. Proofs 

5.1. Proof of Theorem 13.31 To prove the absence of a duality gap, we 
first rewrite PJ.j (resp. Q^j) as a linear program in (standard) form 

min{(x,c)| Gx = 6, xGC}, (resp. max{(ii;,6)[ c — G*w&C*}), 

X w 

on appropriate dual pairs of vector spaces, with associated convex cone C 
(and its dual C*), and associated linear map G (and its adjoint G*). Then, 
we will prove that G is continuous, and the set D := {{Gx, (x, c)) | x E C} is 
closed, in some appropriate weak topology. This permits us to conclude by 
invoking standard results in infinite-dimensional linear programming, that 
one may find in e.g. Anderson and Nash 01. For a brief account see ^6.11 
and for more details, see e.g. Robertson and Robertson ^3], and Anderson 
and Nash [T]. 

Let 6r be as in (|3.2|) . and let M(M"') be the M- vector space of finite signed 
Borel measures fj, on M", such that J 9rd\fj,\ < 00 (where denotes the 
total variation of /u). Similarly, let H^' be the R- vector space of continuous 
functions h : M, such that sup^gj^n \h{x)\/6r{x) < 00. With the 

bilinear form (., .) : M(M") x , defined as 

{fi,h) ^ {p,h) = j hdfi, {n,h) G M(M") X H'', 

(M{Mr'), H^) forms a dual pair of vector spaces (See H6.11 ) Introduce the 
dual pair of vector spaces {X,y), 

X := M(M") X X M, y := H'' xR"^ x M, 
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and {Z, W) 



Z :-- 



X M X M, W := M"" X M X M. 



Recall that 2r > deggj, for all j = 1, . . . ,m, and let G : X ^ Z he the 
linear map 



{n,u,v) I— > G{fi,u,v) := 
with associated adjoint linear map G* : W — > 3^ 



{H,gl) +ui 



ml ^ ""m 
{fl,9r) + V 



(A, 5, 7) ^ G*{X,6,j) :-- 



A 

5 



Next, let M(R")+ C M(R") be the convex cone of nonnegative finite Borel 
measures on R", so that the set C := M(R"')+ x R™ x R+ c *V is a convex 
cone in Af. If denotes the nonnegative functions of , then 

G* = Hixw:^ xM+ c y. 

is the dual cone of G in 3^. 

As 2r > max[deg/, deg5f|] it follows that / € and gj S -ff^, for all 
j = 1, . . . , m. Then, by introducing slack variables u € R™, f G R+, rewrite 
the infinite-dimensional linear program defined in H3.5() . in equality form, 
that is, 

inf {{fi,u,v),{f,0,0)) 

fl,U,V 

" 



(5.1) 



s.t. G{fi,u,v) 



{fi, u, v) e G. 
The LP dual (PJ^)* of P]^^ now reads 



ne 
1 



M2 



sup ((A,J,7),(0,ne^^^ ,1)) 
s.t. (/rO,0)-G*(A,5,7) eC*. 



(5.2) (P; 

Hence, every feasible solution (A, (5, 7) of ('PJ,/)* satisfies 
(5.3) 



f-Y.^^9j-Ser-j>0; X,5<0. 
i=i 

As A, 5 < in (|5.2|) . one may see that the two formulations 1)5. 2|) and (|3.6p 
are identical, i.e., = {V\^)* ■ 

As 2r > max[deg/, deg g"^] , it follows that / - YJJLi Aj g] -dOr-j £ H\ 
for ah (A, 5, 7) G W. Therefore, G*(W) C 3^, and so, by Proposition lO the 
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linear map G is weakly continuous (i.e. is continuous with respect to the 
weak topologies a{X,y) and cr(Z, W)). 

We next prove that the set Z? C ^ x M, defined as 

(5.4) D := {{G{fi, u, v), {{fi, u, v), (/, 0, 0))) | u, v) G C], 

is weakly closed. 

For some directed set (^, >), let {{^p,up,vp)}p^A be a net in C, such 
that 

(G(/i/3,M/3,?;^), ((/i/3,n^,W/3), (/,0,0))) {{a,b,c),d), 
weakly, for some element ((a, b,c),d) E ^ x M. In particular 

and (/i/j, /) d. As (^/j, U/j, w/j) G C, and 0ri5j > 0, it follows immediately 
that a,b,c> 0. We need to consider the two cases c = and c > 0. 

Case c = 0. From /i^(R") — > c, it follows that ^ /i := in the 
total variation norm. But in this case, observe that G{fi,a,b) = {a,b,c). 
It remains to prove that we also have {^i/3,f) — > d = 0, in which case, 
(G(/i, a, b), (/i, /)) = ((a, b, c),d), as desired. 

Recall that r > deg/. Denote by {yaiP)}\a\<2r the sequence of moments 
of the measure /i^, i.e., 

VaiP) = jx'^dfip, aeN", |a|<2r. 

In particular, yo(/?) = ^/^(I^")- From (/i^, 6r)+vp — > 6, there is some /3o G ^, 
such that {n/3,0r) < 26 for all /3 > /?o. But this implies that 

y2kihfi) ■■= j xf' dfj,f3 <2r\b, k<r, i = l,...,n. 

By Lemma [6. H| it follows that y2a{P) < 26r! for all a G N" with \a\ < r, and 
\ya{P)\ < A/2yo(/5) 6r! for ah |a| < r. But then, as yo(/9) = Ai/3(IK") ^ c = 0, 
we thus obtain UaiP) for all |q| < r. Therefore, 

(/^/3,/) = j fdfli3 = Y fa j x'^dflp = fayaiP) ^ 0, 

the desired result. 

Case c > 0. From np{W^) — > c and (/U/3, 9r)+vp — > 6, there is some /3o G A, 
such that /i^(M") < 2c and (/i/3,^r) < 26 for all /? > /3o- But, as Or is a 
moment function, this implies that the family A := {up := ^^//u^(M")}^>q,o 
is a iz^/ii family of probability measures, and as A is a set of probability 
measures on a metric space, by Prohorov's theorem, A is relatively compact 
(see Chap. 1] and section Mf).2|) . Therefore, there is some probability 
measure v* G M(M"), and a sequence {n^} C A, such that Vn^. converges to 
V* , for the weak convergence of probability measures, i.e.. 
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(where Cfc(M") denotes the space of bounded continuous functions on M"); 
see e.g. Bilhngsley p]. Hence, with //* := cz^*, we also conclude 

(5.5) (/x„,,/i) ^ {fi*,h), V/iGCf,(M"). 

Next, as 2r > max[deg/, degg'|], the functions f/6r-i and gj/Or-i-, j = 
1, . . . , m, are all in C{,(M"). Therefore, using Lemma 16.51 we obtain 

(j^nfc,/) {^*J), and {i'nk,9j) ^ {^* , 9j), j = ■ ■ ■ ,rn. 
And, therefore, 

(/^nfc>/) ^ {f^*J) = and ^ {f^* , 9^), 3 = ^, ■ ■ ■ ,m. 

Finally, from the weak convergence (|5.5() . and as 0^ is continuous and non- 
negative, 

{fi*,Or) < liminf (/x„j^,6'r) < b, 

see e.g. Prop. 1.4.18]. 

So, let V := b — {^J.*,Or) > 0, and uj := aj — {^Ji*,gj) > 0, j = 1, . . . ,m, 
and recalling that c = /i*(M"), we conclude that G{fj,*,u,v) = {a,b,c), and 
((//*, M, f), (/, 0, 0)) = d, which proves that the set D in (|5.4|) is weakly 
closed. 

Finally, by Proposition 13.21 V^.j is consistent with finite value as soon as 
M is large enough to ensure that Bm C^V ^ 0. Therefore, one may invoke 
Theorem 16.41 and conclude that there is no duality gap between V^^ and its 
dual Q\j, the desired result. □ 

5.2. Proof of Theorem 14.11 It suffices to prove the result for the case 
where inf3;gv' f{x) = /* > 0. Indeed, suppose that /* = 0. Then with e > 
fixed, arbitrary, /* + ne > and so, suppose that (|4.1|) holds for / := f + ne. 
There is some r(e) G N such that, for all r > r(e), 

m 

f = f + ne + e9r = Qer Ajff|, 

for some s.o.s. polynomial q^r-, and some nonnegative scalars Equiva- 
lently, 

T n ^2k m m 
k=l j=l ' j=l j=l 

where q^r is a s.o.s. polynomial. Equivalently, f2er = Qer — SjLi'^jSj) so 
that 1)4. 1|) also holds for /. Therefore, from now on, we will assume that 
/* >0. 

So let e > (fixed) be such that /* — e > 0, and let r > ro with ro as in 
Theorem ESI Next, by Proposition O let M be such that /* < mfVl^ < 
f* + e. By Theorem 13.31 we then have sup > /*. So, by considering a 
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maximizing sequence of Q^j, there is some (A, 5, 7) € M™ x M+ x M, such 
that 

m 

(5.6) 0<r-e<7-n<5e^' <f* + e; f + 66, + Y, ^] > 1, 

j=i 

and so, 

m 

(5.7) / - (7 - n6e^') + ^ Aj gf > 6{ne^' - 6,). 

j=i 

By Proposition l3.11 we may choose M such that there is some x^./ G Bj^^i2T\V 
such that /{xm) </* + £• Evaluating (|5.7j) at x = xm yields 

(5.8) 2e > f{xM) - (7 - n<5e^') > '^(ne^' - ^^(xm)), 
and so, using ||xa./||oo < M/2, 

(5.9) 2e>Me'''-e"^'/^), 

which yields 6 < 2e/n{e^^ — e^^^"^). Therefore, given e > 0, one may pick 
(A, (5, 7) in a maximizing sequence of Q^, in such a way that 5 < e. 
For such a choice of (A, 5,7), and in view of 1)5. 6() . we have 

m 

f + d9r + Y, 9j > (7 - n(5e^') + n^e^' > f * - e + nJe^' > 0, 
i=i 

so that the polynomial h := f + S9r + XljLi 9j nonnegative. 

Therefore, invoking Theorem 13.41 proved in Lasserre .9 , there is some 
r(e) € N such that, for all s > r(e), the polynomial := h + eOg is a s.o.s. 
But then, take s > max[r, r(e)] and observe that 

xf 



fc=r+l j=l 



and so 



m s n 2 

q^ = h + eds = f + Y,^j9] + {S + e)es-6 ^ 

j=l fc=r+l i=l 

or, equivalently, 

m s n 2 

f + J2^j9] + {^ + e)9s = q, + 6 ^ = q,, 

j=l k=r+l j=l 

where q^ is a s.o.s. polynomial. 

As 6 was chosen to satisfy 5 < e, we obtain 

m 

/ + E 9^ + = 9^ + - ^Ws = L 

where again, q^ is a s.o.s. polynomial. □ 
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6. Appendix 

In this section, we first briefly recall some basic results of linear program- 
ming in infinite-dimensional spaces, and then present auxiliary results that 
are used in some of the proofs in fj^l 

6.1. Linear programming in infinite dimensional spaces. 

6.1.1. Dual pairs. Let X,y be two arbitrary (real) vector spaces, and let 
(., .) be a bilinear form on Af x 3^, that is, a real- valued function on X x y 
such that 

• the map x i— > {x,y) is linear on X for every y & y 

• the map y i— > {x,y) is linear on y for every x & X. 

Then the pair (X, y) is called a dual pair if the bilinear form separates 
points in X and y, that is, 

• for each x & X , there is some y & y such that {x, y) ^ 0, and 

• for each y £ y, there is some x £ X such that {x, y) ^ 0. 

Given a dual pair {X, y), we denote by cr(X, y) the weak topology on X 
(also referred to as the cj-topology on X), namely the coarsest - or weakest 
- topology on X, under which all the elements of 3^ are continuous when 
regarded as linear forms on X. 

Equivalently, the base of neighborhoods of the origin of the cr-topology is 
the family of all sets of the form 

N{I,e) := {xeX\ \{x,y) < e, Vy € /}, 

where e > and / is a finite subset of y. (See for instance Robertson and 
Robertson p. 32].) In this case, if is a net or a sequence in X, 
then Xn converges to x in the weak topology a{X,y) if 

{xn,y) {x,y), Vy G y. 

Definition 6.1. Let {X,y) and {Z,}V) be two dual pairs of vector spaces, 
and G : X ^ Z, a linear map. 

(a) G is said to be weakly continuous if it is continuous with respect 
to the weak topologies a{X,y) and a{Z,}V); that is, if {xn} is a net in X 
such that Xn ^ X in the weak topology a{X,y), then Gxn Gx in the 
weak topology a{X,y), i.e., 

{Gxn,v) {Gx,v), yv G W. 

(b) The adjoint G* : W y oi G is defined by the relation 

{Gx,v) = {x,G*v), yx£X,v£W. 

The following proposition gives a well-known (easy to use) criterion for 
the map G in Definition 16.11 to be weakly continuous. 

Proposition 6.2. The linear map G is weakly continous if and only if its 
adjoint G* maps W into y, that is, G*(W) C y. 
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6.1.2. Positive and dual cones. Let {X,y) be a dual pair of vector spaces, 
and C a convex cone in X, that is, x + x' and Ax belong to C whenever 
X and x' are in C and A > 0. Unless explicitly stated otherwise, we shall 
assume that C is not the whole space, that is, C ^ X, and that the origin 
(the zero vector in X) is in C. In this case, C defines a partial order > in 
X, such that 

X > x' 44> X — x' £ C, 
and C is referred to as a positive cone in Af. The dual cone of C is the 
convex cone C* in y defined by 

C* := {yey\ {x,y) > 0, Vx G C}. 

6.1.3. Infinite linear programming (LP). An infinite linear program requires 
the following components: 

• two dual pairs of vector spaces {X,y). 

• a weakly continuous linear map G : X ^ Z, with adjoint G* : W ^ y. 

• a positive cone G in X, with dual cone G* in 3^; and 

• vectors b £ Z and c £ y. 
Then the primal linear program is 

,g p . minimize {x, c) 

subject to: Gx = 6, x € C. 

The corresponding dual linear program is 

(Q 2) p* . maximize {b, w) 

^ ' ' ' subject to: c- G*w £ G* , w £ W. 

An element of x G A" is called feasible for P if it satisfies (|6.1|) . and P is said 
to be consistent if it has a feasible solution. If P is consistent then its value 
is defined as 

inf P := inf {(x,c) | x is feasible for P}; 
otherwise, inf P = +oo. The linear program P is solvable if there is some 
feasible solution x* £ X, that achieves the value inf P; then x* is an optimal 
solution of P, andf one then writes inf P = minP. The same definitions 
apply for the dual linear program P*. 

The next result can be proved as in elementary (finite-dimensional) LP. 

Proposition 6.3 (Weak duality). //P and¥* are both consistent, then their 
values are finite and satisfy supP* < inf P. 

There is no duality gap if supP* = inf P, and strong duality holds if 
maxP* = minP, i.e., if there is no duality gap, and both P* and P are 
solvable. 

Theorem 6.4. Let D be the set in Z xW, defined as 
(6.3) D := {(Gx, (x,c)) | x G C}. 

//P is consistent with finite value, and D is weakly closed (i.e., closed in the 
weak topology a{Z x M, W x M)j, then P is solvable and there is no duality 
gap, i.e., supP* = minP. 
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(See Anderson and Nash ^ Theor. 3.10 and 3.22]. 

6.2. Auxiliary results. Let B be the Borel sigma-algebra of M", Co(M") 
be the space of bounded continuous functions that vanish at infinity, and let 
9r be as in 1)3. 2() . Let M(M") be the space of finite signed Borel measures 
on M". 

Lemma 6.5. Let r > 1, and let {/Xjjjgj C M(M"^) be a sequence of proba- 
bility measures, such that 



(6.4) sup / Ordfij < oo. 

jeJ J 

Then there is a subsequence {jk} C J and a probability measure fj, on M" 
(not necessarily in M.), such that 



for all continuous functions f : M" M, such that f /9r^i S Cb{W^). 



Proof. Or is a moment function (see (|3.3|) ). and so, (|6.4|) implies that the 
sequence {fJ-j} is tight. Hence, as M" is a metric space, by Prohorov's The- 
orem |5|, Theor. 1.4.12], there is a subsequence {jk} C J and a measure 
/i G M(]R") such that /Xj^. /x, i.e.. 



(6.5) J hfiji^ J hdfi, 

for all h G C{,(M"'), with Cj,(M'^) being the space of bounded continuous 
functions h : M" M. Next, let Uj^ be the measure obtained from ^u^^ by: 

u.^iB) := I Or-idf,,,, BeB. 
Jb 

Observe that from the definition of Or, the function Or/Or-i is a moment 
function, for every r > 1. And one has, 

sup / Or/Or-ldUj^, = sup / Or dflj^, < OO, 
k J k J 

because of ()6.4() . Observe that z^j^(M") < p for all k, and so, we may consider 
a subsequence of {jk} (still denoted {jk} for simplicy of notation) such that 
i/j^(M") ^ p (> 0) as A; ^ oo. With I'j^ := z/jJi/^jM"), for all k, it follows 
that the sequence of probability measures {£'j^}k is tight, which implies that 
there is a subsequence {jn} of {jk}, and a measure z> G M(M"), such that 

as n ^ oo, y hdi'j^ ^ j hdu, V/i G Cb(M"). 

Since zvjj.(]R") p as k ^ oo, we immediately get 

y /i = j h{p + z/j,^ (M*^) - p) di>j^ j hpdv, as n ^ oo. 
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for all h G Cb(M"). Equivalently, with v := pv, 

(6.6) as n ^ oo, j hdiyj,^ ^ j hdv, \fh € CbiM."-). 
But as h/er-i G C;,(M") whenever h G Cf,(R"), dHSl) yields 

y h/Or-idv = lim y h/6r~idvj^ = lim y hdfj,j„ = J hdfi, 

for all /i G a(M"). 

As both /i and 9~}^^dv are finite measures, this implies that 

(6.7) /"(5) := / (l/^r-Odz^, B(^B. 

Jb 

As the subsequence {jn} was arbitrary, it thus follows that the whole sub- 
sequence {i^j^.} converges weakly to u. 

Next, let / : M" ^ M be continuous and such that f/Or-i G Cb(M"). As 
A; — > oo, from (|6.6|) . 



and so, 



if/er-l)dl^j, ^ {f/9r-l)dv^ 



J fdt^j, = J{f/0r-l)Or-ldp,,= Jif/9r-l)du,^ 

^ I if /Or-i) du = I fdfX, [hy^], 

the desired result. □ 
Lemma 6.6. Let p. he a measure on R" (with p.{W^) = yo) be such that 

(6.8) sup sup / x^^ dfi < S. 

i=l,...,n 0<k<r J 

Then, 

(6.9) sup I x'^dfil < ^/mS. 

oeN";|«|<r J 

Proof. Let y = {ya}\a\<2r-, be the sequence of moments, up to order 2r, of the 
measure /i, and let Mr{y) be the moment matrix defined in (|2.3j) . associated 
with ^. Then, (|6.8|1 means that those diagonal elements of M^(y), denoted 

2/2*2 Lasserre are all bounded by S. Therefore, by Lemma 6.2 in [HI, 
a// diagonal elements of Mr(y) are also bounded by 5, i.e., 

(6.10) y2a < S, Va G N", |a| < r, 

and so are all elements of Mr{y) (because Mr(y) ^ 0). Next, consider the 
two columns (and rows) 1 and j, associated with the monomials 1 and x". 
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respectively, and with \a\ < r, that is, M,.(y)(l, 1) = yo and Mr{y){l, j) = 
■jja- As Mr^y) ^ 0, we immediately have 

M,,(y)(l,l) xM,(y)(i,j) > M,(y)(l, j)M,(y)(j, 1) = M,,(y)(l, jf, 

that is, yoy2a > Va- Using that |a| < r and (|6.1U|) . we obtain yoS > y^, for 
all a, \a\ < r, the desired result (|6.9|1 . □ 
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